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ABSTRACT 


The  well-known  a(3  filter  achieves  its  best  performance  when  measurement 
outliers  are  rare.  Two  modifications  of  the  a/3  filter  are  presented  for  improving 
its  performance  when  outliers  are  commonplace.  The  a/3^  filter  uses  a  drag 
coefficient  to  ensure  that  the  target  motion  model  has  a  finite  asymptotic  velocity 
variance.  Although  the  a/3fj,  filter  is  less  sensitive  to  outliers  than  the  a  (3  filter, 
it  does  not  directly  address  the  problem  of  large  spurious  measurements.  The 
a(3/j,a  filter  is  a  probabilistically  based,  nonlinear  target  estimation  filter  that  is 
much  more  robust  against  outliers  than  the  a/3  filter.  To  facilitate  understanding 
of  the  filter  optimization  problem  and  to  enable  computation  of  filter  outputs 
crucial  to  the  post-processing  task  of  associating  collections  of  tracking  filters,  it 
is  recommended  that  the  a (3  filter,  together  with  all  the  important  earlier  work 
that  has  made  it  useful  in  applications,  be  reinterpreted  in  the  Kalman  filter 
paradigm. 
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1.  INTRODUCTION 


The  a/5  tracking  filter  for  estimating  target  position  and  velocity  from  position 
measurements  pre-dates  the  Kalman  filter  by  several  years;  see,  for  example, 
the  classic  papers  [1]  and  [2].  Implicit  in  its  design  is  the  assumption  that  mea¬ 
surement  outliers  are  rarely  encountered.  While  such  an  assumption  may  be 
appropriate  at  sufficiently  high  signal-to- noise  ratio  (SNR),  it  is  clearly  not  ap¬ 
propriate  when  the  SNR  is  low  enough  that  outliers  axe  commonplace.  Tracking 
filters  for  low  SNR  targets  must  therefore  be  designed  with  some  provision  to 
accommodate  measurement  outliers.  Two  simple  modifications  of  the  a/5  filter 
are  proposed  here  for  improving  its  performance  when  measurement  outliers  are 
a  frequent  occurrence. 

The  first  modification  of  the  a/5  filter  is  the  use  of  a  drag  coefficient,  denoted 
by  //,  to  modulate  the  velocity  in  the  prediction  equation.  The  resulting  tracker, 
called  herein  the  a/5//  tracker,  is  less  sensitive  to  outliers  than  the  standard  a/5 
filter.  It  is  discussed  in  Section  2.  The  a/5//  tracker  reduces  to  the  a/5  tracker 
when  ii  =  l. 

The  second  modification,  discussed  in  Section  3,  extends  the  a/5//  tracker  to 
include  a  probabilistic  data  association  (PDA)  technique  to  estimate  a  weighted 
combined  measurement  innovation  (called  the  “error”  or  “residual”  in  the  usual 
language  of  the  a/5  filter)  to  use  in  the  position  and  velocity  update  equations. 
The  resulting  filter,  called  the  a/3/icr  filter,  is  more  robust  to  tracking  in  the 
presence  of  outliers  than  the  standard  a/5  filter;  see  [3]  and  [4]  for  examples. 

It  is  recommended  in  Section  4  that  the  a/5  filter,  together  with  the  important 
developmental  work  that  has  made  it  useful  in  applications,  be  reinterpreted 
in  a  completely  probabilistic  paradigm  such  as  that  of  the  Kalman  filter.  A 
probabilistic  theoretical  foundation  is  necessary  because  it  greatly  facilitates 
understanding  the  parameter  optimization  problems  of  the  a/5,  a/5//,  and  a/5//cr 
filters,  and  because  it  enables  the  computation  of  the  statistically  meaningful 
filter  outputs  essential  to  the  task  of  correctly  associating  collections  of  tracks 
—  a  task  performed  in  the  post-processor  downstream  of  the  tracking  filters. 

2.  THE  <xp  FILTER  WITH  VELOCITY  DRAG 

The  discussion  throughout  is  restricted  to  one  dimensional  position  measure¬ 
ments,  but  extension  to  the  general  measurement  case  is  conceptually  straight¬ 
forward.  Let  :r0|o  and  fr0|o  be  given  initial  estimates  of  target  position  and  ve- 
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locity,  and  let  T  denote  the  interval  between  successive  measurements.  For 
k  =  1, 2, the  standard  a/3  tracking  filter  prediction  equations  are  given  by 


Xk>\k-i  —  Xk-\\k-i  +  T  Ufc-iifc-i ,  (2.1) 

vk\k-i  =  Vk-  I|fc-1  •  (2.2) 

Let  zk  denote  the  measured  position  at  time  kT.  The  a/3  filter  estimation  equa¬ 
tions  at  time  kT  are  given  by 

Xk\k  *Efc|fc— 1  “t-  &k(Zk  l))  (2.3) 

Vk\k  Vk\k—  1  “I"  Z^i^k  —  Xk\k— 1)?  (2-4) 


where  the  coefficients  ak  and  (3k  are  tracking  system  gains.  These  system  gains 
are  either  given  or  computed  adaptively  using  information  that  is  statistically 
independent  of  target  position  and  velocity,  e.g.,  estimated  SNR  at  time  kT. 

The  a(3/i  tracking  filter  is  the  a(3  filter  modified  by  a  drag  coefficient  in  the 
velocity  prediction  equation  (2.2).  Specifically,  it  is  assumed  that 


1  M  Vk— i|fc— I?  (2.5) 

where  p  is  chosen  so  that  0  <  p  <  1.  The  standard  a/3  filter  is  the  special  case 
p  =  1.  The  a/3p  filter  was  first  proposed  in  [5].  (For  a  related  discussion  on 
exponential  data  weighting  and  asymptotic  stability  in  a  Kalman  filter  setting, 
see  [6,  Section  6.2]).  The  a(3p  filter  is  useful  in  situations  where  velocity  esti¬ 
mates  are  biased  high  due  to  extraneous  effects  such  as  outliers  (i.e.,  spurious 
measurements  unrelated  to  the  target).  Drag  coefficients  n  <  1  result  in  a  finite 
asymptotic  velocity  variance  with  additive  Gaussian  errors,  as  is  shown  below, 
at  the  cost  of  introducing  lag  into  the  filter  estimates. 

The  Gauss-Markov  dynamical  target  velocity  model  implicit  in  (2.5)  is 

vk  =  pt  vk-i  +T  wk,  (2.6) 

where  {wk}  is  a  sequence  of  independent,  identically  distributed  Gaussian  ran¬ 
dom  accelerations.  Denote  the  variance  of  these  random  accelerations  by  a2 

J  acc 

The  second  moment  of  vk  is,  using  (2.6)  and  the  independence  of  wk  and  vk-i, 

E[(vk)2}  =  E[{ p  Vk~i  +T  Wk)2] 

=  p2E[(vk^)2]+T2a  L-  (2.7) 
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It  follows  by  induction  from  (2.7)  that,  for  k  >  1, 


E[(vkf]  =  E[{vQf)+T^lJz^ 

3= 0 

=  SkE[(vo  )2]+T^lcC{~^y  (2-8) 

Similarly,  E[vk]  =  ^E[vk- 1],  and  E[vk]  =  fikE[v 0].  Squaring  and  subtracting  from 
(2.8)  gives  the  variance  of  vk  in  terms  of  fj,  and  the  variance  of  v0.  Explicitly, 

Var[vk\  =  E[(vk)2}  -  (. E[vk ])2 

=  ^Var[v0]+T2alc(^-^y  (2-9) 

Taking  the  limit  in  (2.9)  as  k  — ►  oo  gives,  since  fj,  <  1, 

T^rr2 

a2vd  slim  Var[vk\  =  - — s*f.  (2.10) 

fc-+oo  1  —  [A* 

From  (2.10)  it  is  clear  that  cr^  — ►  oo  as  fi  — ►  1;  hence,  the  case  p  =  1  results 
in  dynamical  models  that  are  physically  unrealistic  for  targets  of  finite  mass. 
Solving  (2.10  )  for  p  gives 


/r  = 


(2.11) 


The  variance  ratio  ( Toacjavei )2  is  dimensionless. 

If  the  asymptotic  velocity  variance  cr^  is  specified,  the  standard  ad  filter  is 
recovered  approximately  for  small  random  accelerations.  Table  1  supports  this 
statement  numerically. 

A  more  general  analysis  with  nonstationary  drag  coefficients  {fik}  is  poten¬ 
tially  of  interest,  especially  if  fj,k  is  to  be  estimated  adaptively  as  a  function  of 
target  type  or  estimated  SNR. 
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Table  1.  Evaluation  of  Equation  (2.11) 

T C  gcc  /  (7  vej 


0.436 

0.9 

0.600 

0.8 

0.714 

0.7 

0.800 

0.6 

0.866 

0.5 

0.917 

0.4 

0.954 

0.3 

0.980 

0.2 

0.995 

0.1 

3.  THE  FILTER  WITH  PDA 

The  a(3  and  ap/i  filters  do  not  estimate  a  covariance  matrix  corresponding  to 
their  state  estimates  (2k\k,  Vk\k)-  They  also  do  not  require  the  specification  of 
measurement  covariances.  These  statistically  significant  oversights  are  easily 
rectified  by  reformulating  the  a(5  filters  as  Kalman  filters;  however,  such  refor¬ 
mulations  lose  some  of  the  historically  important  computational  economy  of  the 
original  filter. 

A  more  serious  problem  is  that  the  a(3  filter  performs  poorly  in  environ¬ 
ments  that  experience  outliers.  Rectifying  this  problem  requires  modifying  the 
basic  structure  of  the  filter.  The  PDA  technique  is  designed  for  tracking  in  the 
presence  of  outliers  using  the  Kalman  filter  [7,  Chapter  3];  however,  the  assump¬ 
tions  of  the  Kalman  filter  are  unnecessarily  restrictive.  The  proposed  technique 
respects  the  computational  economy  of  the  original  a(3  filter  and  avoids  the 
Kalman  filter  formalism.  Discussion  and  examples  of  the  PDA  ct(3  filter  axe 
given  in  [3]  and  [4]. 

Measurements  are  typically  generated  in  practice  by  thresholding  a  sensor 
output  “image”  comprising  a  finite  number  of  pixels,  or  cells,  to  locate  the  output 
“peaks.”  Thresholds  may  be  fixed  or  chosen  adaptively  to  match  the  estimated 
background  noise,  and  they  may  vary  from  cell  to  cell.  Preprocessors  typically 
refine  these  initial  peaks  in  some  way  (e.g.,  beam  interpolation)  to  improve 
estimates  of  their  location,  but  the  end  result  is  conceptually  unchanged  —  the 
estimated  locations  of  sufficiently  large  (i.e.,  above  threshold)  local  peaks  of  the 
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sensor  output  make  up  the  measurement  set  used  for  tracking.  Spurious  local 
peaks  (i.e.,  measurements)  unrelated  to  the  target  are  generated  by  noise  at  the 
sensor  input,  a  problem  that  is  greatly  exacerbated  when  the  target  SNR  is  low. 
Consequently,  all  practical  trackers  must  provide  some  mechanism  for  treating 
measurement  outliers. 

In  the  PDA  Kalman  filter,  an  innovation  variance  a2nn  is  updated  during  the 
prediction  phase  of  the  tracking  filter.  For  the  a/3/ia  filter,  a2nn  is  assumed  to 
be  a  known  function  of  a,  (3,  and  g  and  so  is  independent  of  k  (unless  a ,  /?,  and 
g  are  estimated  adaptively).  The  nominal  validation  gate  is  the  interval 

G  —  1  9&inni  3'k\k— 1  "b  9^inn\i 

where  g  >  0  is  a  specified  constant  called  the  normalized  gate  width  parameter. 
In  practice,  however,  the  gate  width  must  be  adjusted  slightly  from  scan  to  scan 
to  accommodate  the  changing  values  of  xk\k_i  relative  to  the  fixed  cell  structure 
of  the  sensor  output.  The  exact  validation  gate  is  defined  by  the  interval 

G[k)  =  [3?fc|fc—  i  Qk^innt  %k\k— 1  “t"  9k&inn\i  (3.1) 

where  the  normalized  gate  width  parameter  gk>  0  is  computed  from  xk\k-i  and 
the  sensor  cell  structure  so  that  gk  ~  g.  Let  m(k)  >  1  denote  the  number  of 
measurements  of  target  position  at  time  kT,  and  let 


{zk(l),zk(2),...,zk{m(k))}  (3.2) 


denote  the  measurements  themselves.  Measurements  lying  outside  the  gate  G(k) 
are  deleted;  hence,  it  is  assumed  that  all  the  measurements  (3.2  )  lie  within  the 
gate  G(k).  Let  Af(x;a,a2)  denote  the  Gaussian  density  function  with  mean  a 
and  variance  a2  evaluated  at  the  point  x,  and  let 


pG(k)  = 


%k\k— 1 )  ® inn)  ^X 


(3.3) 


denote  the  probability  that  a  target  measurement  falls  in  the  gate.  Thus, 
Pa{k)  1  as  gk  -»  oo.  Let 


e« 


£o 


exp 


zkii)  -xk\k- 1 


l  2 ' 


Q(k)  \  ( 1  -  PpPG(k)\ 
9k  {  Pd  ) 


i  =  1, ...,  m(k), 


(3.4) 

(3.5) 
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where  Pd  is  the  probability  that  a  measurement  of  the  target  is  in  the  mea¬ 
surement  set  (3.2),  and  m(k)G ^  is  the  expected  number  of  measurements  that 

fall  within  the  gate.  Evaluating  m(k)G ^  requires  a  statistical  model  of  outliers, 
which  may  not  be  available.  Another  approach  is  simply  to  put  s0  =  m(k).  In 
practice  it  is  probably  preferable  to  take  e0  as  a  free  design  parameter  that  is 
either  specified  in  advance  or  estimated  adaptively. 

Compute  the  conditional  probabilities 


7 U  ----- 


7T0  = 


£o  + 


i  —  1,  ...,m(k), 


£o 


£o  +  £j 


and  compute  the  weighted  combined  innovation 

m(fc) 

Vk=  (' 2 k(})  ~Xk\k-l)  ■ 

i=  1 

The  prediction  equations  of  the  a/3/j,cr  filter  are 


(3.6) 

(3.7) 


(3.8) 


Xk\k-i  —  Sfc-iifc-i  +  T  ,  (3.9) 

^  (3.10) 

and  the  estimation  equations  of  the  af3^a  filter  are 

%k\k  %k\k— 1  “b  O-k  Z/fc,  (3.11) 

(3k  , 

Vk\k  =  Vk\k-\  +  —  I'fc.  (3-12) 


The  prediction  equations  (3.9)-(3.10)  are  identical  to  the  apfi  filter,  and  the 
estimation  equations  (3.11)-(3.12)  are  altered  from  those  of  the  afin  filter  by 
the  use  of  a  weighted  innovation  over  the  m{k)  measurements.  The  nonlinear 
innovation  is  crucial  to  the  ability  of  the  aft/icr  filter  to  handle  outliers. 

An  important  special  case  is  m(k)  =  1,  k  =  1,2,...,  that  is,  when  there 
is  exactly  one  measurement  at  every  time  kT.  From  (3.6),  (3.4)  and  (3.8),  it 
follows  that  the  weighted  combined  innovation  is 


Vk 


Zk{  1)  ~  gfcjfc- 1 


1  +  So  exp 
K  (zfc(l)  -£jfc|fc-l)  , 


&  inn 


T  2 


(3.13) 

(3.14) 
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(3.15) 


where  the  innovation  gain  k  is,  for  general  real  values  u, 

k  —  k(u)  — - -7- 

1  +  £0  exp  <  \ 

It  is  evident  from  (3.13)  that  i/k  — *  0  as  the  unweighted  measurement  innovation 
2jt(l)  —  Xk\k-i  goes  to  oo.  Thus,  the  a/3ga  filter  does  not  reduce  to  the  afty  filter 
because  of  the  possibility  that  the  given  measurement  is  an  outlier  and  does  not 
actually  originate  from  the  target.  Prom  (3.13)  and  (3.5),  it  also  follows  that 
the  a/3na  filter  reduces  to  the  a/3 g  filter  when  m(k)  =  1  only  if  the  product 
Pd PG(k)  =  1,  that  is,  only  in  the  limit  as  the  gate  width  parameter  g  —>  oo  and 
Pd  =  1-  The  result  (3.13)  is  important  in  applications  because  it  shows  that 
less  than  the  full  measurement  innovation  should  be  used  in  the  a(3  filter  when 
there  exists  an  element  of  doubt  (e.g.,  PD  <  1)  as  to  the  correct  origin  of  the 
measurement. 


The  general  shape  of  the  innovation  gain,  or  weighting  function,  k(u)  over 
the  gate  G(k)  is  controlled  by  the  parameters  e0  and  ainn.  Let  a  =  k(0)  denote 
the  peak  value,  and  let  b  —  K(gk<7inn)  denote  the  value  at  the  edge  of  the  gate. 
Clearly,  0  <  a  <  1  and  0  <  b  <  1.  Using  (3.15)  and  solving  these  two  equations 
for  e0  and  the  normalized  gate  width  parameter  g^  gives 


£o  = 


1  -a 


and 


gk  = 


\ 


2  In 


a  1  — 


1  —  a  b  )' 

For  example,  for  a  =  0.98  and  b  =  0.70,  equations  (3.16)  and  (3.17)  give 


(3.16) 

(3.17) 


£0  =  0.02041  and  gk  =  2.468. 


(3.18) 


Figure  1  plots  k(u)  for  this  example  for  the  special  case  ainn  =  1. 

The  nonlinear  Bayesian  style  ratios  (3.6)  -  (3.7)  are  probably  more  signifi¬ 
cant  than  the  exponential  function  in  (3.4),  so  any  similarly  tapered  function 
can  be  used  without  risking  significant  loss  of  performance.  For  example,  the 
approximation 


_ j.2 72  _  1  -  0.075  x2 

6  ~  1  +  0.625  x2  ’ 

has  an  error  of  at  most  0.0694307  and  the 
specified  interval;  see  Figures  2  and  3. 


\x\  <  3.65148,  (3.19) 

right  hand  side  is  nonnegative  on  the 
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4.  RECOMMENDATIONS 


The  a/3  filter  is  predicated  on  the  assumption  that  the  correct  target  measure¬ 
ment  is  known  and  is  always  correctly  assigned  to  the  filter.  Given  sufficiently 
many  outliers  or  low  enough  SNR,  such  an  assumption  will  lead  to  serious  per¬ 
formance  degradation.  The  greater  the  outlier  density  or  the  lower  the  SNR,  the 
more  often  measurements  unrelated  to  the  target  will  enter  the  filter  and  cause 
it  to  lose  track.  Methods  are  widely  available  in  the  tracking  literature  (see, 
e.g.,  [7])  for  dealing  with  outliers  in  a  tracking  filter;  however,  incorporating 
these  advances  into  the  a/3  filter  requires  switching  to  a  completely  probabilistic 
paradigm  such  as  that  of  the  Kalman  filter. 

It  is  recommended  that  the  a/3  filter,  together  with  all  the  important  work 
that  has  gone  into  making  it  useful  in  applications,  be  reinterpreted  in  the 
Kalman  filter  paradigm.  There  are  several  reasons  for  this  recommendation. 
Firstly,  the  switch  to  a  probabilistic  formulation  greatly  facilitates  understand¬ 
ing  tracking  filter  optimization.  The  important  practical  problem  of  selecting 
the  best  numerical  values  of  the  available  design  parameters  (e.g.,  a,  (3,  /i,  and 
a)  becomes  more  difficult  as  the  number  of  parameters  increases.  Secondly,  a 
probabilistic  formulation  enables  the  computation  of  filter  outputs  (such  as  the 
error  variance  of  the  estimated  track)  that  are  vital  to  the  task  of  associating 
multiple  tracks  —  a  task  performed  in  the  post-processor  downstream  of  the 
tracking  filters.  Track  association  is  a  very  important  problem  in  many  appli¬ 
cations,  and  consistent  statistical  tracker  outputs  are  essential  to  its  solution. 
Finally,  switching  to  a  probabilistic  formulation  promotes  informed  technical 
peer  review  and  enables  advances  in  state-of-the-art  tracking  technology  to  be 
incorporated  more  quickly  into  current  systems  and  applications. 
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Figure  1.  Innovation  gain  k(u)  over  the  gate  G(k)  =  [—2.468,  2. 
=  0.98,  b  —  0.70,  and  ainn  =  1. 


Figure  2.  Left  and  right  hand  sides  of  the  approximation  (3.19) 
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